Homology Search Methods

نویسندگان

  • Daniel G. Brown
  • Ming Li
چکیده

Homology search methods have advanced substantially in recent years. Beginning with the elegant Needleman-Wunsch and Smith-Waterman dynamic programming techiques of the 1970s, algorithms have been developed that were appropriate for the data sets and computer systems of their times. As data sets grew, faster but less sensitive heuristic algorithms, such as FASTA and BLAST, became a dominant force in the late 1980s and 1990s. As datasets have grown still larger in the post-genome era, new technologies have appeared to address these new problems. For example, the optimal spaced seeds of PatternHunter increase speed and sensitivity. Using these ideas, we can achieve BLAST-level speed and sensitivity approaching that of slow algorithms like the Smith-Waterman, bringing us back to a full circle. We wish to take you with us on this round trip, with some detours along the way so as to study both global and local alignment. We present methods for general purpose homology that are widely adopted, not individual programs.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Computational Identification of Micro RNAs and Their Transcript Target(s) in Field Mustard (Brassica rapa L.)

Background: Micro RNAs (miRNAs) are a pivotal part of non-protein-coding endogenous small RNA molecules that regulate the genes involved in plant growth and development, and respond to biotic and abiotic environmental stresses posttranscriptionally.Objective: In the present study, we report the results of a systemic search for identifi cation of new miRNAs in B. rapa using homology-based ...

متن کامل

Exploring genomic dark matter: a critical assessment of the performance of homology search methods on noncoding RNA.

Homology search is one of the most ubiquitous bioinformatic tasks, yet it is unknown how effective the currently available tools are for identifying noncoding RNAs (ncRNAs). In this work, we use reliable ncRNA data sets to assess the effectiveness of methods such as BLAST, FASTA, HMMer, and Infernal. Surprisingly, the most popular homology search methods are often the least accurate. As a resul...

متن کامل

Measuring the Average Accuracy Performance of Homology Search

Homology searches of genomic databases are the essential step for new gene discovery. While novel search techniques are being developed to address this need, it is equally important to provide a unified experimental methodology for evaluating and comparing the different homology search methods. Current evaluation measures such as the Receiver Operating Characteristic(ROC) [5], the Coverage Vers...

متن کامل

HorA web server to infer homology between proteins using sequence and structural similarity

The biological properties of proteins are often gleaned through comparative analysis of evolutionary relatives. Although protein structure similarity search methods detect more distant homologs than purely sequence-based methods, structural resemblance can result from either homology (common ancestry) or analogy (similarity without common ancestry). While many existing web servers detect struct...

متن کامل

SIMPRO: simple protein homology detection method by using indirect signals

MOTIVATION Detecting homologous proteins is one of the fundamental problems in computational biology. Many tools to solve this problem have been developed, but development of a simple, effective and generally applicable method is still desirable. RESULTS We propose a simple but effective information retrieval approach, named SIMPRO, to identify homology relationship between proteins. The key ...

متن کامل

Rapid similarity search of proteins using alignments of domain arrangements

MOTIVATION Homology search methods are dominated by the central paradigm that sequence similarity is a proxy for common ancestry and, by extension, functional similarity. For determining sequence similarity in proteins, most widely used methods use models of sequence evolution and compare amino-acid strings in search for conserved linear stretches. Probabilistic models or sequence profiles capt...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2003